Detecting Significant Multidimensional Spatial Clusters

نویسندگان

  • Daniel B. Neill
  • Andrew W. Moore
  • Francisco Pereira
  • Tom M. Mitchell
چکیده

Assume a uniform, multidimensional grid of bivariate data, where each cell of the grid has a count ci and a baseline bi. Our goal is to find spatial regions (d-dimensional rectangles) where the ci are significantly higher than expected given bi. We focus on two applications: detection of clusters of disease cases from epidemiological data (emergency department visits, over-the-counter drug sales), and discovery of regions of increased brain activity corresponding to given cognitive tasks (from fMRI data). Each of these problems can be solved using a spatial scan statistic (Kulldorff, 1997), where we compute the maximum of a likelihood ratio statistic over all spatial regions, and find the significance of this region by randomization. However, computing the scan statistic for all spatial regions is generally computationally infeasible, so we introduce a novel fast spatial scan algorithm, generalizing the 2D scan algorithm of (Neill and Moore, 2004) to arbitrary dimensions. Our new multidimensional multiresolution algorithm allows us to find spatial clusters up to 1400x faster than the naive spatial scan, without any loss of accuracy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Space-time Analysis of Breast Cancer and Its Late-stage Cases among Iranian Women

Background Spatial scan statistic has been shown as a useful tool to investigate spatial patterns and detecting the spatial clusters of cancer. This study conducted to study spatial analysis of breast cancer and its late-stage cases, one of the most common women cancers in Iran and the world. Methods We used space-time and purely spatial scan statistic implemented in SaTScan software to detec...

متن کامل

Detecting Irregularly Shaped Significant Spatial and Spatio-Temporal Clusters

Detecting significant overdensity or underdensity clusters in spatio-temporal data is critical for many real-world applications. Most existing approaches are designed to deal with regularly shaped clusters such as circular, elliptic and rectangular ones, but cannot work well on irregularly shaped clusters. In this paper, we propose GridScan, a grid-based approach for detecting irregularly shape...

متن کامل

A genetic algorithm for spatiotemporal cluster detection and analysis

Although increased exploration of large-scale databases has provided the impetus for better detection and analysis of spatial clusters, there is slow progress in developing clustering algorithms for classifying space-time multidimensional attributes and spacetime-attribute interactions. The objective of this study is to enhance the genetic algorithm for detecting clusters in spatiotemporal or m...

متن کامل

A binary-based approach for detecting irregularly shaped clusters

BACKGROUND There are many applications for spatial cluster detection and more detection methods have been proposed in recent years. Most cluster detection methods are efficient in detecting circular (or circular-like) clusters, but the methods which can detect irregular-shaped clusters usually require a lot of computing time. METHODS We propose a new spatial detection algorithm for lattice da...

متن کامل

Investigation of Sea Surface Temperature (SST) and its spatial changes in Gulf of Oman for the period of 2003 to 2015

Considering the great application of Sea Surface Temperature (SST) in climatic and oceanic investigations, this research deals with the investigation of spatial autocorrelation pattern of SST data obtained from AVHRR sensor for Gulf of Oman from 2003 to 2015 (13 years). To achieve this aim, two important spatial statistics, i.e. global Moran and Anselin local Moran’s I were employed within mont...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004